Leroy Ozores
The FIFA World Cup! One of the most exciting tournament in the world that occurs every four year (in my opinion). The World Cup is between national teams from all over the world.
The data that I will be using comes from FIFA, “Fédération Internationale de Football Association”, which translates to, International Federation of Association Football in english. This dataset collects data from 900 World Cup matches that was played in its history from Uruguay in 1930 to Russia in 2018. Using data from Kaggle that includes the top ranks of the tournament, stage of the tournament, home/away team, outcome of the match, and more. My approach to this data is to make analysis and insights of all the World Cup tournamnet matches. My goal is to see if I can uncover new information in the data that isn’t clear for viewers.
## # A tibble: 2 × 6
## year stage date home_team away_team outcome
## <dbl> <chr> <date> <chr> <chr> <chr>
## 1 1930 Group 1 1930-07-13 France Mexico H
## 2 1930 Group 4 1930-07-13 Belgium United States A
## # A tibble: 1 × 6
## year stage date home_team away_team outcome
## <dbl> <chr> <date> <chr> <chr> <chr>
## 1 2018 Final 2018-07-15 France Croatia H
How popular is the FIFA World Cup? How many attended? Which year had the highest amount of audience?
| year | host | attendance | games | winner |
|---|---|---|---|---|
| 1994 | USA | 3568567 | 52 | Brazil |
| 2014 | Brazil | 3441450 | 64 | Germany |
| 2006 | Germany | 3367000 | 64 | Italy |
Each tournament is exciting, not knowing which National Team will take the title. I wouldn’t know which team has taken the most titles. Lets have a look.
Every match is exciting, especially when your team scores. Every one standing up screaming Gol, bumping to each other and even spealing our drinks to our neighbor. How many gols screams are there per game?
## # A tibble: 900 × 16
## year stage home_team away_team home_score away_score Total_Gols country
## <dbl> <chr> <chr> <chr> <dbl> <dbl> <dbl> <chr>
## 1 1930 Group 1 France Mexico 4 1 5 Uruguay
## 2 1930 Group 4 Belgium United Stat… 0 3 3 Uruguay
## 3 1930 Group 2 Brazil Yugoslavia 1 2 3 Uruguay
## 4 1930 Group 3 Peru Romania 1 3 4 Uruguay
## 5 1930 Group 1 Argentina France 1 0 1 Uruguay
## 6 1930 Group 1 Chile Mexico 3 0 3 Uruguay
## 7 1930 Group 2 Bolivia Yugoslavia 0 4 4 Uruguay
## 8 1930 Group 4 Paraguay United Stat… 0 3 3 Uruguay
## 9 1930 Group 3 Uruguay Peru 1 0 1 Uruguay
## 10 1930 Group 1 Argentina Mexico 6 3 9 Uruguay
## # ℹ 890 more rows
## # ℹ 8 more variables: city <chr>, outcome <chr>, win_conditions <chr>,
## # winning_team <chr>, losing_team <chr>, date <date>, month <chr>,
## # dayofweek <chr>
## # A tibble: 81 × 2
## home_team home_score
## <chr> <dbl>
## 1 Brazil 177
## 2 Argentina 114
## 3 West Germany 93
## 4 France 85
## 5 Italy 72
## 6 Germany 65
## 7 England 50
## 8 Hungary 49
## 9 Spain 49
## 10 Belgium 45
## # ℹ 71 more rows
## # A tibble: 36 × 5
## year stage home_team away_team outcome
## <dbl> <chr> <chr> <chr> <chr>
## 1 1930 Semifinals Argentina United States H
## 2 1930 Semifinals Uruguay Yugoslavia H
## 3 1934 Semifinals Czechoslovakia Germany H
## 4 1934 Semifinals Italy Austria H
## 5 1938 Semifinals Brazil Italy A
## 6 1938 Semifinals Hungary Sweden H
## 7 1954 Semifinals Austria West Germany A
## 8 1954 Semifinals Hungary Uruguay H
## 9 1958 Semifinals Brazil France H
## 10 1958 Semifinals Sweden West Germany H
## # ℹ 26 more rows